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Abstract. We show that, in a restricted range, the divisor function of integers in residue classes modulo 
a prime follows a Gaussian distribution, and a similar result for Hccke eigenvalues of classical holomorphic 
cusp forms. Furthermore, we obtain the joint distribution of these arithmetic functions in two related residue 
classes. These results follow from asymptotic evaluations of the relevant moments, and depend crucially on 
results on the independence of monodromy groups related to products of Kloosterman sums. 



1. Introduction 

The distribution of arithmetic functions in arithmetic progressions is one of the cornerstones of modern 
analytic number theory, with a particular focus on issues surrounding uniformity with respect to the modulus 
(see [7] for a recent survey). Besides the case of primes in arithmetic progressions, much interest has been 
devoted to the divisor function d(n) and higher-divisor functions, in particular because in some precise 
sense - a good understanding of a few of these is equivalent to knowledge about the primes themselves (see, 
e.g., [6, Theoreme 4]). 

The consideration of the second moment for primes p < X in arithmetic progressions to moduli q < 
Q < X/(\ogX) A leads to the Barban-Davenport-Halberstam theorem (see, e.g., jT3j Th. 17.2]), which has 
been refined to an asymptotic formula for Q — X by Montgomery [2D]. Similarly, Motohashi [2T] evaluated 
asymptotically the variance of the divisor function d{n) for n < X in arithmetic progressions modulo q < X . 

We will show that one can determine an asymptotic distribution for the divisor function d(n) for n < X 
in arithmetic progressions modulo a single prime p, provided however that X is a bit smaller than p 2 . 

Theorem 1.1 (Central Limit Theorem for the divisor function). Let w be a non-zero real-valued smooth 
function on M with compact support in ]0, +oo[ and with L 2 norm \\w\\. For a prime p, let 

S d {X,p,a)= ^2 

n>l 
n=a mod p 

1 / 77 \ 1 r+°° / T \ 

M d (X,p) = -J2d(n)w(-)~- (logx + 2 7 -2logp) W (-)dx (1.1) 

P n>l & ^° 

= lY, d (Mx-) +0 (w x{logX) )' 



and 



n>l 

where 7 is the Euler constant. For a C , let 

E d (X,p, a) 



S d (X,p,a)-M d (X,p) 
(W /2 
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Let <£>(a;) > 1 be any real-valued function, such that 

$(x) — > +00 as x -> +00, = O e (a; e ), 

for any e > and x > 1. For any prime p, let X = p 2 /$(p). TTien as p — > +00 over prime values, the 
random variables 

E d (X,p,a) 



a 1 — ^ 



||«;||V27r- 2 (log*(p))3 

on Fp , wi£/i i/ie uniform probability on F* , converge in distribution to a standard Gaussian with mean and 



rx 
p 

variance 1, i.e., /or any reaZ numbers a < (3 , we have 

e- 1 / 2 dt 



p-l 



a t= Fp I a< ^ (X ' P ' a) </?} 



' - <! o - II- ' n 

||w||^27r- 2 (log$(p)) 3 ' J p^°° V27T 

In fact, our results are more general, in three directions: (1) we will consider, in addition to the divisor 
function, the Fourier coefficients of any classical primitive holomorphic modular form / of level 1 (e.g., the 
Ramanujan r function); (2) we will compute the moments of the corresponding random variables and, for 
a fixed moment, obtain a meaningful asymptotic in a wider range of X and p; (3) we will also consider the 
joint distribution of 

(E d (X,p,a),E d (X,p,j{a))) 

when 7 is a fixed projective linear transformation (e.g., 7(a) = a + I, 7(a) = 2a, 7(a) = —a, 7(a) = I/a, 
which illustrate various interesting phenomena.) For all these results, the crucial ingredients are the Voronoi 
summation formula, and the Riemann Hypothesis over finite fields, in the form of results of independence of 
monodromy groups of sheaves related to Kloosterman sums. 

We now introduce the notation to handle these more general problems. As in the statement above, we fix 
a non-zero smooth function w : K — > K, with compact support in [u>o,u>i] with < Wo < w% < +00. For 
any modulus c > 1, let 

S d (X,c,a)= ^ d(n)w( 



n>l 

n=a mod c 



This sum has, asymptotically, a natural main term (see, e.g., |17j ) which we denote by M d (X,c), and 
which coincides with M d (X,p) when c — p is prime (see (I2.8P below). The number of terms in S d (X,c,a) 
is w X/c and the square root cancellation philosophy suggests that its difference with the main term should 
be of size 

S d (X,c,a) - M d (X,c) « {X/c)^X\ (1.2) 
as long as X/c gets large. Thus the map 

Z: a £ (Z/cZ) ^E d (X,c,a) = (x/c) 1 / 2 ' 

is a natural normalized error term that we wish to study as a random variable on (Z/cZ) x equipped with 
the uniform probability measure (here and below, we sometimes omit the dependency on p and X to lighten 
the notation Z). 

Similarly, consider a primitive (Hecke eigenform) holomorphic cusp form / of even weight k and level 1 
(these restrictions are mainly imposed for simplicity of exposition). We write 

f{z) ^^2p f {n)n^ l ^ 2 e{nz) 

n>l 

its Fourier expansion at infinity, so that p/(l) = 1 and Pf(n) is the eigenvalue of the Hecke operator T(n) 
(suitably normalized). We let 



S f (X,c,a)= Pf( n ) w (-^), M f (X,c) = - ^Pf(n)w(j£), 

n=a(mod c) n>l 

S f (X,c, a) - M f (X,c) 



E f (X,c,a) 



(X/cfl 2 



for c > 1 and any integer a. Note that, in this case, the integral representation 

/■2+ioo 



M f (X, c) = - x — / u-(s) X s L( fl , f)ds 

C ZIYI J2-ioo 



' 2 — zoo 

in terms of the Mellin transform ii) of ro shows that the main term is very small, namely 

„-i v—A 



M f {X,c) <f, A c-'X- A (1.3) 



for every positive A, uniformly for c > 1 and X > 1. 
We will study the distribution of 

a (->• Ef(X,p, a), BH>E,i(I,p,fl) 

for p prime using the method of moments. Thus, for any integer k > 1, we define 



M 



*(JT,c;«) = - V £*(X,c,a) K , * = dor/. (1.4) 



C 

a mod c 
(a,c)=l 



The first moment is very easy to estimate, and besides Motohashi's work (which considers the average 
of Aid(X, c; 2) over c < X), the second moment has recently been discussed by Blomer [2J, Lu [IB] and 
Lau-Zhao |17j . In particular, Lau and Zhao obtained an asymptotic formula in the range X 1 / 2 < c < X 
(see (|1.10[) below; note that the range c < X 1 / 2 seems to be much more delicate.) 

We will evaluate any moment, in a suitable range. Precisely, in ^21 we will prove: 

Theorem 1.2. Let the notation be as above, with * = d, the divisor function, or * = / . / a Hecke form of 
weight k and level 1. Let p be a prime number. Then, for every integer k > 1, for every positive d, for every 
positive e, for every X satisfying 

2 < X 1/2 < p < X 1 - 5 , (1.5) 

we have the equality 

M4X,p; K ) = C^) + o(p- 1 ^(^y /2 + (^) 1/2+e ), (1.6) 

where the implied constant depends on (S,e, ft, f,w), and the constant C*(k) is given by 

C*(k) = cl[w m K , (1.7) 

with 

if k is odd, 



(1.8) 



2 k / 2 {k/2)\ 



if k is even, 



Cf,w = IH| 2 ||/H 2 l^T, c dtW =P w (\og4), (1.9) 



r(fc) ' w \ °x, 

for some polynomial P W (T) £ R[T], depending only on w, of degree 3 with leading term 2||u>|| 2 7r~ 2 X' 3 . Here, 
for a cusp form f, the L 2 -norm of f is computed with respect to the probability measure 

3 dxdy 
7r y 2 

on SL2(Z)\H, and the L 2 -norm of w is computed with respect to the Lebesgue measure on R. 

Remark 1.3. In the case k — 2, and in the range X 1 / 2 < c < X , Lau and Zhao [T71 Theorem 1 (2)] have 
obtained 

\t\^ £ M^^ + o^W^E^f), (MO) 

a—1 n=a(mod c) t\c 

l<n<X 

for any modulus c > 1 (not only primes), and a similar result for the divisor function. 
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We will make further comments on this result after the proof, in Section 13.51 Since m K is the K-th 
moment of a Gaussian random variable with mean and variance 1, we obtain the following, which implies 
Theorem 1 1.1 1 in the case *= d: 



Corollary 1.4 (Central limit theorem). Let > 1 be any real-valued function, such that 

3>(x) — > +oo as x — »• +oo, $>(x) = £ (x c ), 

for any e > 0, uniformly for x > 1. For any prime p, let X = p 2 /$(p). Then as p — > +oo over prime values, 
the random variables 

Ei,(X,p, a) 
a i y ■ — 

on F* converge in distribution to a standard Gaussian with mean and variance 1. 

As far as we know, this is the first result of this type. We will prove this in Section [3J and give further 
comments, in Section [3~6l 

Among the natural generalizations of this result, we consider next the following one: given a map a 7(a) 
on Fp , what is the asymptotic joint distribution of 

OH- (E 1r (X,p,a),E ir (X,p,'Y(a))) ? 
We study this when 7 is given by a fractional linear transformation. Precisely, let 

7=f! Jf) G GL 2 (Q)nAf 2 (Z) (1.11) 



c 



be a fixed invertible matrix with integral coefficients. For p j det 7, the matrix 7 has a canonical reduction 
modulo p in PGL2(F p ), which we denote by 7^(7). In the usual manner, 7 (or 7^(7)) defines a fractional 
linear transformation on Pp p by 

™i az + b 

By Corollarv ll.41 we know that, in the range of validity of this result, both 

E*{X,p,a) ^(A,p,7-n) 
Z : a h> — and Z o 7 : a >-> — — , (1-12) 



seen as random variables defined on the set 

{a € F p I a, 7 • a 7^ 0, 00} 

converge to the normal law. We then wish to know the asymptotic joint distribution of the vector (Z, Z o 7), 
and we study this issue, as before, using moments. 
For k and A positive integers, let 

M4X,p; K ,X- n ) := - V E+(X,p, a) K E*(X,p, 7 • a)\ (1.13) 

a, 7-a^0,oo 

be the mixed moment of order (k,X). 

In analogy with Theorem 11.21 we will estimate these moments in 2) To state the result, we note that if 
7 is diagonal, there is a unique triple of integers (0^,71,72), such that we have the canonical form 

7 = a 7 ° V 71 > 1 and (71,72) = 1. (1.14) 



72 

We further introduce the arithmetic functions 

^-n("/y)- w(p y )- '^n(«-)-«^), d.15) 

p a ||a * p°||a 

for a > 1, and p a f — for a < 0, p a d = p a d for a < 0. For * = /, we also define the constant 

c / = ||/|| 2 (47r) fe r(fc)- 1 . (1.16) 

Our result is: 
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Theorem 1.5. Let 7 be defined by (jl.ll)j . 

(1) For every integers n and X, for every S and e > 0, for every prime p > Po("f) and X satisfying (jl.5[) . 
there exists C*(fc, A, 7) suc/i that 

^(X,p; K ,A;7) = a(K 1 A )7 ) + C»^-i+ e (^J + (-J J. (1.17) 

(2) If 7 is non-diagonal, then 

C*(k,A,7) = C*(k)C*(A). (1.18) 

(3) 1/7 is diagonal, and written in the canonical form (|1.14l) . i/ien 

if k + A is odd, 

C*(k,A,7) = ^ ^ ^( j(jm K _ i ,m A _ l/ (c^ u; )^" , '(a^ tu , 7 )^, otherwise, (1.19) 



= k. = A mod 2 



where 



c /^7i72,/( / w(7ii)w(7 2 i)dt), 
J —00 

and for -k — d, we have 

2 

/or some polynomial P-y 1 -y 2 ,w{T) £ K[T], 0/ degree < 3 and wii/i coefficient ofT 3 given by 

2 /■ r°° \ 

(jit)w(^t)dt)T 3 . 



n 2 ^7172 



P7,7 2 ,d / W 



In (|1.17l) . £/ie implied constant depends at most on (7, 5, e, K, A), and in (|1.19[) . we make the convention 
that Q v = 1 if v = 0. 



Of course, if 7 is the identity, we recover Theorem 11.21 More generally, we can now determine the joint 
asymptotic distribution of (Z, Z o 7) in the same range as Corollary II .41 

Corollary 1.6. Let $ be a function as in Corollary \l-4\ and let X = p 2 /$(p). Then, for * = / or d, as p 

tends to infinity, the random vector (Z, Z o 7) converges in distribution to a centered Gaussian vector with 
covariance matrix 



1 

1 

1 C 

x 7,111 
^*,7,«J 1 



i/7 is not diagonal. (1-20) 
, if 7 is diagonal, (1-21) 



where the covariance 7iUJ is given by 



g^, w = / w(n x t)w{ l2 t)dt. 

\\ w \\ Jr 

Recall that a pair (X, Y) of random variables is a Gaussian vector if and only, for every complex numbers 
a and j3, the random variable aX + j3Y has a Gaussian distribution (see, e.g., [M] pp. 121-124]). If (X, Y) 
is a Gaussian vector, its covariance matrix cov(X, Y) is defined by 

covOcyW e(x 2 )-e(x) 2 e(xy)-e(x)e(y)\ 

1 ' > - \E(XY) - E(X)E(Y) E(Y 2 ) - E(Y) 2 J ' 1 > 

where E denotes the expectation of a random variable. Recall also that a Gaussian vector (X, Y) has 
independent components if and only if E(XY) = E(X)E(Y), i.e., if the covariance matrix is diagonal (see [T4l 
Theorem 16.4] for instance). Thus from Corollarv ll.6l (noting that p O(i ^0 for any integer a 7^ 0), we get a 
criterion for asymptotic independence of (Z, Z o 7): 
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Corollary 1.7. We adopt the notations and hypotheses of Corollarv \l.b\ Then asp tends to oo, the random 
variables Z and Z o 7 tend to independent Gaussian random variables, if and only if one of the following 
conditions holds: 

(1) If 7 is not a diagonal matrix, i.e., a 1— > 7 • a is not a homothety, 

(2) If "f is a diagonal matrix and 

/oo 
u>(7i*)M72*) dt = 0, 
-00 

(3) If -k = f , 7 is a diagonal matrix in the from (jl.141) . and there exists a prime p and a > 1 such that 
p Q ||7 2 7i and such that 

(p + l)p f (p a )=p f (p)p f (p a - 1 ). 

Remark 1.8. (1) Corollarv ll.7l shows for instance that, for p — > 00, the random variables a H> E+(p 2 /&(p),p, a) 
and a 1— > E±(p 2 /<&(p),p, 7-0) converge to independent Gaussian variables, if 7 is one of the following functions 

7 • a = a + 1, 7 • a = — a, 7-a = l/a. 

The case of 7 • a = 2a is more delicate, since it depends on the value of the integral / + °° w(t)w(2t) dt. 
For instance, this integral is zero when one has the inequalities wq < w\ < 2w$ < 2wi, where as before 
supp(w) C [wo,wi]- The possible dependency here reflects the obvious fact that if n = a modp and d \ n, 
then 2n = 2a mod p and d \ 2n. 

(2) We do not know if any primitive Hecke form / of level 1 exists for which Condition (3) in this last 
corollary holds for some p a \ Certainly the "easiest" way it could apply would be if, for some p, we had 
Pfip) = 0, but the existence of a primitive cusp form of level 1 and a prime p with pf(p) =0 seems doubtful 
(e.g., a conjecture of Maeda suggests that the characteristic polynomials of the Hecke operators T(p) in level 
1 are irreducible.) On the other hand, if we extend the result to forms of fixed level N > 1, it is possible to 
have Pf{p) = for some p (e.g., for weight k = 2 and / corresponding to an elliptic curve.) 

1.1. Sketch of the proof. We will sketch the proof in the case of cusp forms, which is technically a bit 
simpler, though we present the actual proofs in a unified manner. For Theorem 11.21 the crucial starting 
point is the Voronoi summation formula, as in [2j 1 1 7 j . which expresses Ef(X,c,a) for any c > 1 in terms 
of sums weighted by some smooth function of the Fourier coefficients pf(n) twisted by Kloosterman sums 
S(a, n; c). One then sees that the main contribution to this sum comes from the n of size roughly Y — c 2 / X 
(see Proposition ^. ip . 

Considering the K-th moment, we obtain therefore an average over a mod p of a product of k Kloosterman 
sums S{a,ni,p), where all variables ni are of size approximately p 2 /X. The sum over a £ , when the 
variables ni are fixed, can be evaluated using deep results on the independence of Kloosterman sheaves 
(see Proposition I3.2[) . This allows us to gain a factor p 1 / 2 compared with a direct application of the Weil 
bound for Kloosterman sums, except for special, well-understood, configurations of the ni modulo p. These 
configurations lead, by combinatorial arguments, to the Gaussian main term of Theorem 1 1.21 (Note that we 
can take no advantage of the summation over the variables n^, which turn out to have a short range in the 
cases where our result is non-trivial, see Section [XU) 

The study of mixed moments (see Theorem 1 1.5[) has a lot of similarities. The only significant difference 
lies in the study of the independence of Klosterman sheaves, when some of them are twisted by the rational 
transformation 7. However, Proposition 13.21 is general enough to show that these sheaves are dependent if 
and only if we are in the "obvious" cases. The main terms then require some computations of integrals using 
properties of the Bessel transforms. 

1.2. Possible extensions. A Gaussian law similarly appears if one studies the random variable a H> 
E^(X,p, P(a)), where P is a non-constant fixed polynomial with integer coefficients. The fact that P is 
not necessarily a bijection on ¥ p does not affect the Gaussian behavior. The proof of this extension requires 
a suitable generalization of Proposition ^. 21 

It also seems that the present method can be extended to the study of the distribution of sums of the 
shape 

a 1 y S^X^^Ka) = T+(n)K a (n)w(^p) 

n>l 
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where r+(-) is cither d(-) or /?/(•), and K a (n) — K{an) for a fairly general trace function K as in [5]. The 
shape of the analogue of Theorem 11.21 would then depend on the nature of the geometric monodromy group 
of a suitable "Bessel transform" of the sheaves underlying K(-). 

Another natural extension, which we are currently considering, is that of coefficients of cusp forms on 
higher-rank groups, and of higher divisor functions. 

1.3. Notations. We use synonymously the notation f(x) -C g(x) for x £ X and / = 0(g) for x € X. We 
denote e(z) = e 2l7rz for zeC. For c > 1 and a, b integers, or congruence classes modulo c, the Kloosterman 
sum S(a, b; c) is defined by 

SM;c)= £ e (H±») 

x mod c 
(x,c)=l 

where x is the inverse of x modulo c. The normalized Kloosterman sum is defined by 

S{a,b;c) 
Kh{a,b;c) = — — , 

and for (a, 6, c) = 1 it satisfies the Weil bound 

\Kl 2 (a,b;c)\<d(c). (1.23) 

To lighten notations, we define 

Kl 2 (a;c) :=Kl 2 (a,l;c), 
and recall the equality Kl2(a, b; c) = Kl 2 (a&; c), whenever (b, c) = 1. 

We will use the Bessel functions J&_i, where fc > 2 is an integer, Yo and Kq; precise definitions can be 
found for instance in [TTJ App. B.4] and in |24) . 

2. Preliminaries 

We gather in this section some facts we will need during the proof of the main results. The reader may 
wish to skip to Section [3] and refer to the results when they are needed. 
We begin with the Voronoi formula in the form we need: 

Proposition 2.1 (Voronoi summation). Let * = /, for a cusp form f of level 1 and weight k, or * = d. 
Let c be any positive integer, with c prime if -k = d. Then for any X > 1 and for any integer a, we have the 
equality 

E*(X,c,a) = — ^(^) 1/2 ^r4n)^(^)Kl 2 (a,n; Cl ), (2.1) 

ci\c n^O 1 

ci>l 

where n runs on the right over non-zero integers in Z and 

, > I Pf(n) if n > 1, . 

r/WHn «, (2 - 2) 

I (J otherwise, 

T d (n) = d(\n\), (2.3) 

and 

/"OG 

W f (y)=2TTi k w(u)J fc _i(47rVuy)dM fory>0, (2.4) 
Jo 

W/(y) = 0, /or 2/ <0, 

/•cx: 

W d (3/) = -2tt / w(w)y (47r 1 /^)du, /or y > 0, (2.5) 



W d (j/) = 4 / w(u)X (47ry^j)d u , /or y < 0. (2.6) 
Jo 

In particular, if c — p, a prime, we have 

E*(X,p,a)= (4) 1/a £r*(n)W^(^)Kl 2 (a ) n;p). (2.7) 

^ n^O 
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For the proof we recall the standard Voronoi summation formula (see, e.g., [T3J p. 83] for * = / and [TBI 
(4.49)] for -k — d, which we rewrite as a single sum over positive and negative integers instead of two sums). 

Lemma 2.2. Let c be a positive integer and a an integer coprime to c. 

(1) For any smooth function w compactly supported on ]0, oo[, we have 

n>l n>l 

if f is a cusp form of level 1 and weight k. 

(2) For any smooth function w compactly supported on ]0, oo[, we have 

5^d(n)u>(n)e(— ) = - /" + °° (logs + 2 7 - 2logc)w(x)dx + - ^ d{\n\)W d (^j e(-— ) . (2.8) 

n>l c c Jo c n ^ c c 

Proof of Provosition [Ol We consider the case of * = /, the divisor function being handled similarly (it is 
easier since c is prime; the definition (jl.ip of the main term is designed to cancel out the first main term 
in (|2.8p ). Using orthogonality of additive characters, and separating the contribution of the trivial character 
from the others, we write 

*/(*.*.)- i£.(-|)£«<»K£MT) 

6=0 n>l 

n>l 1<6<c-1 n>l 

which yields the expression 

l<6<c-l n>l 
We split the second according to the value of the g.c.d d = (6, c), writing 

d = (6, c), b = dbi, c = dc\, 

and note that 

1 < ci < c, 1 < 6i < ci. 

We then get 

v y d|c 1<6<c n>l 
(6,c)=d 

-p^E E «(-£)S>c»K£M£ 

V ; Cl \c l<6i<ci n>l 
ci>l (6i,ci)=l 

We can now apply Lemma 12.21 since (&i,ci) = 1, and we get 

£«<"K£M£Ke«<»KtM-£ 

n>l n>l 1 

The proposition now follows since the terms with n < are identically zero for this case. □ 

We will need some basic information on the behavior of the Bessel transforms 

Proposition 2.3. Let w be a smooth function with support included in ]0, +oo[. Let W*(y) be one of the 
Bessel transforms of w as defined in Provosition \2.1[ for some integer k > 2 in the case * = / of weight k. 
(1) The function W± is smooth on K x , and for every A > and j > 0, we have 

y j wP(y) « Aj min(l + |log \y\\, \y\- A ), (2.9) 

for y^Q. 



(2) We have 

\\W4 = HI, (2-io) 

where the L 2 -norm ofW+ and w are computed in L 2 (R X ) with respect to Lebesgue measure. 

(3) More generally, for any two non-zero real numbers m and n, we have 

/oo 
w(mt)w(nt)dt. 
-oo 

Proof. (1) (Compare, e.g., with (2] p. 280], pTl Lemma 3.1]) We begin with the case j — 0. For y small, we 
use the bounds 

J k -i(x) <k 1, Y (x) < l + |logat|, K (x)<l + \1ogx\ 
for < a; < 1 which immediately imply 

W*(y)«l + |logM| (2.11) 

in all cases. 

To deal with the case where \y\ > 1, we first make the change of variable 

v = 47r\/ u\y\ 

in the integrals (|2.4I) (resp. (|2.5p . (I2.6[l ). so that we always get 



where -Bo = cJk-i, 0, cYo or cKq, for some fixed multiplicative constant c 6 C. 

We denote a = (167T 2 ?/ 2 )" 1 . To exploit conveniently the oscillations of the Bessel functions Bq we integrate 
by parts, using the relations (see [TQl 8.472.3, 8.486.14]) 

{x v+l Z v+l {x))' = ex v+1 Z u (x), (2.12) 

where 

'+1 HZ v = J v ovY v , 
-1 if Z v = K v 

For -k = /, remembering that to vanishes at and oo, we obtain, for instance, the equality 

Wf(y) = -^- [ (2av 2 w'(av 2 ) + (1 - k)w(av 2 )) J k (v)dv (y > 0). 
|2/| io v y 

By iterating £ > 1 times, and then arguing similarly for * = d, we see that there exist coefficients £^ „ 

such that 

W*(y) = -- (av 2 Yw^(av 2 ))v- e+1 B e (v)dv, (2.13) 

where = Jfc-i+£, 0, or Kg corresponding to the different cases * — f or ★ = d, y > or y < 0. 
Since w has compact support in [wq, wi], the above integral can be restricted to the interval 

X:= [(wo/a^Aw^a) 1 / 2 ], 

and using the estimated 

Jk-i+t{v) V" 1 / 2 , Y e (v) «, w- 1 / 2 , K e (v) «, t;- 1 / 2 
for u > 1, we obtain the inequality 

W*(y) < M^ 1 / tr' + 4d« < | y |-i-^/2+3/a (214) 



for |?/| > 1. Since f > is arbitrary, this gives the result for j = 0. 

We can reduce the general case to j = using the formulas (see |T0j 8.472.2, 8.486.13]) 

xZ' v (x) = vZ v [x) - xZ u+ i(x), 



1 For the last one, one knows in fact that K{(v) decays exponentially fast for v +oo. 
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from which it follows that 

^ . POO nOO rOO 

y dy\J w ( u ) z »(^Vuy) du ) = 2 J w{u)Z v {A-K^/wy)du - Zn^/y J w(u)y/uZ v+1 (4n^/uy)du. 
Applying the previous method to the relevant Bessel functions then leads to 

yWi(y) <*, A min(l + | log \y\\,y~ A ) 

and by induction a similar argument deals with higher derivatives. 
(2) In the case * = /, the identity 

+00 z'+oo 

W f {ufdu= / w{u) 2 du = \\w\\ 2 
Jo 

is a direct consequence of the unitarity of the Hankel transform, i.e., of the Fourier transform for radial 
functions (see, e.g., jTTJ Lemma 3.4]). The case * = d is less classical, although it is formally similar, the 
hyperbolas xy — r replacing the circles x 2 + y 2 = r 2 (see |T3l §4.5]). We use a representation-theoretic 
argument to get a quick proof. The unitary principal series representation p = 7r(0) of PGL2(K) (in the 
notation of [H p. 10]) can be defined by its Kirillov model with respect to the additive character tp(x) = e(x), 
which is a unitary representation of PGL2(K) on L 2 (R X , |;r| _1 Gia;). In this model, the unitary operator 



(G »)) 



T = P 

on L 2 (W X ,\x\^ 1 dx) is given by 

T V {x) = J v (t)J{xi)- 

where J is the so-called "Bessel function" of p (with respect to ip, see |H Th. 4.1]). By |4] Prop. 6.1, (ii)] 
(see also [TJ §6, §21]), we have 



J(u) = 

Hence by (|2.5p and (|2.6[) , we see that 



-2ny^uY (4:TTy/u) for u > 0, 
A\/\u\KQ(&n^/\u\) for u < 0. 



W d (y) = \y\- 1/2 T(<p)(y), where <p(x) = \f W{x) * X > ° (2.15) 

11 x < 0. 



The unitarity of T means that 

/ |T(v>)(y,l|^= / 

i.e., 

\W d (y)\ 2 dy= I \w(x)\ 2 dx=\\w 



, 2 dy _ f 1 / M2 dx 



■r 



+ OO 

2j„ _ ll„„l|2 



(3) We consider different cases. If mn > 0, changing t to —t allows us to assume that m and n are positive. 
Then a simple polarization argument from (|2.10p shows that 

» + OG />00 

W*(mt)W*(nt)dt= / tt) ro (u)ro„ (it)d«, (2.16) 



where w 1— > Xo m (u) is the function for which the Bessel transform of is i 1 — >■ W*(mt) and similarly for Xo n (u). 
But it is immediate that tv m (u) = (1/m) w(u/m), and therefore (|2.16|) gives the result. 

If mn < 0, then since the support of w is contained in [0, +00 [, we have w(mt)w(nt) = for all t, hence 

w(mt)w(nt)dt = 0, 

and we must show that the integral of W i ,{mt)W i ,{nt) is also zero. If * = /, a cusp form, this is immediate 
since Wf(y) = for y < 0, so that Wf(mt)Wf(nt) = for all t. 
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For * = d, we use representation theory as in (2). With the same notation as used there, and for any 
real-number a ^ 0, we denote 

»-<{i :)) 

so that, by definition of the Kirillov model (see [4j §4.2, (4.1)]), we have 

U a {tp)(x) = (p(ax) 
for ip e L 2 (R X , l^l -1 ^). Observe that, in PGL 2 (R), we have 

(0 -l\ fa (A _ f-l \ fO -l\ _ fa- 1 (A fO -1 

^i o y ly v° - a ) v °/ V° v I 1 

hence 

ToU a = U a -ioT. 
Using this and the unitarity of T, we deduce that 

-dx 



{Tv){ax){Tv){bx)— = (U a (T<p),U b (T<p)) = (T{U a -«p),T(U b -«p)) 
\x\ 

= (U a -i<p,U b -itp) 

f / x \ / x \ dx f dx 

= / n~M7-)i-r = / V{bx)ip(ax) — . 
Jr ^a/ \b/\x\ J R \x\ 

Now, applying (|2. 15|) and the fact that Wd is real-valued, we derive 

Wd{ax)Wd{bx)dx = I w(ax)w(bx)dx 
Jm 

for all non-zero a and b. □ 

Remark 2.4. One can also give a direct proof of the last part of this proposition using known properties of 
Bessel functions: the crucial point is that the function 

/>oo 

</>M)= / Y (a^)K (b^y)dy 
Jo 

is antisymmetric, which follows from an explicit evaluation using [101 6.523] and [22l p. 153, 2.34]. Conversely, 
the results for cusp forms can be proved using representation theory, the discrete series representation of 
weight k replacing the representation p. 

Our last preliminary results concern the sums which will give rise to the leading terms in the main results. 
Recall the definitions and (ll.lGp . 

Proposition 2.5. Let p be a prime number, S > a parameter and X > 1 such that 

X 1/2 <p< x 1 - 8 . 

Let Y = p 2 1 X . For * € {d, /}, and for a and b coprime non-zero integers, not necessarily positive, let 

' an \ / bn s 



B*(o,6,y)= ]T r*(an)r*(HW;(™)w*(y). 

(1) If -k = f , we have 



1< | an | , \bn\ <p/2 



B±{a,b,Y) = c fPabf (J w(at)w(bt)dt)Y + 0{Y 1/2+e ) 



for any e > 0. 

(2) If -k = d, there exists a polynomial P ab 6 R[T] of degree at most 3, depending on w, such that 

B d (a, b, Y) = P Qb (log Y)Y + 0(Y? +e ) 
li 



for any e > 0, and with coefficient of T 3 given by 

^2Pab,d(J w(at)w(bt)dtjT 3 . (2.17) 
In both cases, the implied constants depend on (<5, e, *, a, &). 

We will use standard complex integration techniques, and first determine the relevant generating series 
(it is here that it is important that / be a Hcckc cigcnform.) We denote 

F*( S ) = 5>(n) 2 |nr , 

so that 

L(s,fxf) 
f{) C(2s) 

if / is a Hecke eigenform, where L(s, f x /) is the Rankin- Sclberg convolution L-function, and 

PW 9 CW 4 
Fd{s) = 2 «2s-y 

In both cases, -F*(s) extends to a meromorphic function, with polynomial growth in vertical strips, for 
Re(s) > 1/2. It has only a pole at s = 1 in this region (of order 1 if * = /, and order 4 if * = d). 

Lemma 2.6. Let * = / or d, and a, b be non-zero coprime integers. Let 

F*,a,b( s ) = ^2 T f( an ) T f( bn )\ n \~ S - 

If*= f and ab < 0, we have F* ;0; f, = 0. Otherwise, we have 

p s 

p"p\\ab 

In particular, F* ;0; f, always extends to a meromorphic function for Re(s) > 1/2, with polynomial growth 
in vertical strips. 

Proof. One sees immediately that it is enough to treat the case where a, b > 1 and ab ^ 1. Then the 
assumption that (a, b) = 1 allows us to write 

F*,a,b{s) = i^,a6,l(s) 

so that we can further reduce to the case where b = 1, in which case we write .F+.a.i = F*, a - Now, writing 
any integer n^O (uniquely) as n — jm where j > 1 has all prime factors dividing a and m ^ is coprime 
with a, and summing over j first, we get 

F*A*) = E T f (jm)Tf(ajm)\jm\- s 

l<j|a°° (m.a)-l 

= E T fU) T f( a j)j~ s E r /( m ) 2 i m r s 

j'la 00 (m.a) — 1 



f.w(l]E T *'fr") ' E '■/OK/WM - ', 



p|a fe>0 j|o° 



by multiplicativity of r/. 
Now write 



p|a 

the factorization of a. Again by multiplicativity, we get 

j|a°° p|a fc>0 
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Let 

G i =£T / (/)T / (/+>- feS 

for some fixed prime p and integer i > 0. For i > 1 and fc > 1, we have 

Tf(p k+l ) = r f (p k )r f (f) - Tjtf-^Tftf-V, 

and therefore 

G i =T f (p i )G -p- s T f (p i - 1 )G 1 
for i > 1. In particular, the case i = 1 gives 

(l+p- s )G! =r / (p)G , 

which then implies that 

G*-{r f {p) pS + 1 JGo 

for i > 1. Now, since z/ p > 1 by definition, it follows that 

t/(p) t /(p"* _1 ) 



F i ,^8)=F it (8)H(T f ( J r>) 



p s + l 

p\a 

as claimed. □ 
Proof of Provosition [KR Using Proposition 12. 31 (1), we obtain first 

B*(a,b,Y) = B°(a,b,Y) + O^ 1 ) 

where 

B»(o,6,y) = X;r*(onK(6n)W*(^)w;(y). 

n#0 

Let y be the Mellin transform of 







(fK 


§) 







Again by Proposition 12.31 (1), this is a holomorphic function, bounded and decaying quickly in vertical 
strips, for Re(s) > 0. We have the integral representation 

B°(a, b, Y) = -L / F* >oi ,(s)Y >(«)*», 



2«7T 



(2) 



and we proceed to shift the contour to Re(s) = 1/2 + e, for a fixed e > 0. The integral on the line 
Re(s) = 1/2 + e satisfies 

=?- / F* iab (s)Y s cp(s)ds « F 1 / 2 ^ 

lm J(l/2+e) 

where the implied constant depends on (*, a, b, e, w). On the other hand, the unique singularity that occurs 
during the shift of contour is the pole at s = 1. 
If * = /, this is a simple pole and 

res s= i F* )06 (s)F s </j(s) = Y<p(l) res s= i F/ i06 (s). 

Since 

ip(l) = J W f (at)W f (bt)dt = J w{at)w(bt)dt 

by Proposition 12.31 (3), and since it is well-known that 

res s=1 F / ( S ) = ||/|| 2 (4 7 r) fc r(fc)- 1 =c / , 

(from Rankin-Selberg theory, see, e.g., [12l (13.52), (13.53)]), we see that Lemma l2~6l exactly gives the result 
in the case of a cusp form 

If * = d, on the other hand, we have a pole of order 4, and we see that 

res s=1 F dM ( S )Y s ip(s) = YP ab (logY) 
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where the polynomial P a: b has degree at most 3 and has coefficient of T 3 given by 
1 1 
6 C(2)' 

again by Proposition I2.3| (3). This concludes the proof. □ 



-Pab,d(J K W d (at)W d (bt)dty 3 = ^p ab ^J^w(at)w(bt)dt)T 3 



3. Proof of Theorem 11.21 

3.1. First step. Let p be a prime such that the condition (|1.5[> holds. To shorten the notation, we write 

Y=p 2 /X, (3.1) 

which is > 1 under our assumption. We also write simply W = W* depending on whether we treat the case 
of cusp forms or of the divisor function. 
From (|2.7p in Proposition [2Tj we deduce 



pY 



x J2 Kl 2 (ani;p)---Kl 2 (an K ;p), (3.2) 

l<a<p 



which we write in the form 

M*(X, P ;k):=-^(X 1 + X 2 ) (3.3) 

where Si corresponds to the contribution of the (m, . . . , n K ) such that 1 < | rx» | < p/2 for all i and S 2 is the 
complementary contribution of those (m, . . . , n K ) such that |rij| > p/2 for one i at least. 

3.2. Study of S 2 . We first deal with S 2 , which is easy. By symmetry, we may restrict to the case where 
l^i I > P/2- By Deligne's bound 

\p f (n)\ < d(n) (3.4) 
(in the case of a Hecke eigenform /) and the Weil bound (| 1 . 23[) for Kloosterman sums, we have in both cases 

S 2 «( £ d (\^\)\w(Y)\)x^d(\n\)\w^)\y'\ 

|ni|>p/2 ri^O 

Applying (|2.9[) with A > 3, we deduce 

E 2 « X e (Y A /p A ~ 1 )Y K ~ 1 

for any e > and hence 

By assumption, we have p < X 1 s , hence taking A = A(5, k) sufficiently large we prove the inequality 

£ 2 «X-\ (3.5) 
which combined with p.3[) is acceptable in view of the error term claimed in (| 1 .€>[) . 

3.3. Study of Si. The study of Si is the crux of the matter. To handle precisely the sum of Kloosterman 
sums over a in (|3.2[) . which is a sum over a finite field, we will use a deep result in algebraic geometry. But 
first of all, we must prepare the combinatorial configurations of the arguments n\, . . . , n K , in order to be 
able to detect the main term. We shall even put it in a more general setting to cover the proof of Theorem 
11.51 The following definition deals with the decreasing sequence of multiplicities. 

Definition 3.1 (Configuration). Let p be prime. Let (3 :— (J3i, . . . , f3 K ) £ (PGL 2 (F p )) K be a n-tuple of 
projective linear transformations modulo p. There exist an integer v satisfying 1 < v < k, a v-tuple fi = 
(/ii, . . . ,/i„) of positive integers fa satisfying 

Mi > A*2 > • • • > > 1 and p,\ + ■ ■ ■ + fi u = n. 
and v distinct elements (ci, • • • , u v ) G (PGL 2 (F p )) 1 ', such that we have 

{Pi,"' ,Pk} = Wl,"' ,<Tv}, 



and 

\{i : 1 < i < k, fa = <Tj}\ = [ij 
for all j , with 1 < j ' < v. The integer v and the v-tuple (jii, . . . , p, v ) are unique, and the latter will be called 
the configuration of (3, the integer v will be called the length of the configuration and the entries fij its 
multiplicities. 

If all the multiplicities fij are even, we will say that /3 has a mirror configuration. In particular its length 
/i is even. 

In the next proposition, we will see that the asymptotics for a sum of products of Kloosterman sums 
shifted by the projective transformations fa depends only on the configuration of f3, rather than on the 
precise values of the fa. 

Proposition 3.2. let p be a prime. Let k > 1, f3 = (J3i, . . . , fa) £ (PGL2(F p )) K be a n-tuple of elements 
of the projective linear group with associated configuration fi — (/ii, . . . ,fJ, v ). 
Consider the sum 

&( K ,f3,p)= ]T Kh(fa-a;p)---Kh(fa-a;p). 

a mod p 
/3i-a^0 : oo(l<i<K) 

We then have 

6( K ,f3,p)=A(n)p + K ( P 1 i), (3.6) 

where A(fi) is the product of integrals 

A(n) = ( - J (2 cos O f 1 sin 2 0d0"j ■ ■ ■ ( - J (2 cos 6)^ sin 2 ddO^ . 

The product A(fi) is an integer, which is positive if and only if f3 is in a mirror configuration and 
otherwise, in which case we have 

6(K,p,p) = 0(pl). 

Finally we have 

A(2,2,...,2) = l. (3.7) 

This is a generalization of a result of Fouvry, Michel, Rivat and Sarkozy (see Lemma 2.1]), which only 
dealt with the case where the fa are all diagonal and distinct modulo p. 

Proof. By the definition of the configuration, the sum equals 

6(«,/3,p)= Kh(a 1 -a; P y i ---Kh(a„-a;pr», 

(l<i<u) 

where the elements <7j, 1 < i < v, are distinct in PGL2(F p ). 

For £ 7^ p, let Kl be the (normalized) ^-adic Kloosterman sheaf constructed by Deligne and studied by 
Katz in [15]. This is a lisse Q^-sheaf of rank 2 on G mj F p , which has trivial determinant. For some isomorphism 
L '■ Qi ~> C, it satisfies 

t(trace(Frob aiFp |/C^)) = -Kl 2 (a;p) 
for any a £ . Moreover, ]C£ is Lie-irreducible, tamely ramified at with a single unipotent Jordan block, 
and wildly ramified at oo with Swan conductor 1 and with a single break at 1/2. 

Given 7 € PGL2(F p ), let 7*0 be the pullback of JCl by the fractional linear transformation 7 : x H> 7 • x; 
this sheaf is lisse on P Fp — {7 _1 ({0, 00})} and for any a € F p such that 7 • a ^ 0, 00, it satisfies 

t(trace(Frob a , F j7*0)) = -Kl 2 ( 7 • a;p). 

Katz 15j computed the geometric monodromy group of Kl, and showed that it is equal to SL2, and 
coincides with the arithmetic monodromy group of K,£. The same is therefore true for 7*0. 
We make the following: 

Claim. For a\ and 02 distinct elements of PGL2(F p ) and Jz? any rank one sheaf, lisse on some non-empty 
open subset of P F , the sheaves a\K£ <£> and a^ICi are not geometrically isomorphic. 
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Proof. We may assume that a\ — Id and that a = 02 is not the identity. If a is an homothety, the claim was 
proven in [19j Lemme 2.4]. We now reduce to this case. Assume that Kt ® _Sf and <r*0 are geometrically 
isomorphic. Since ££ is of rank 1, its only possible breaks at infinity are integer, and hence /C£® Jz? is wildly 
ramified at 00. So o~*K,i is also wildly ramified at infinity, which means that a ■ 00 = 00. Furthermore, 
K(.® is also ramified at 0, and hence o~*lC£ must also be ramified, which means a ■ = 0. But this implies 
that a is a homothety, and we apply the result of |T5]. □ 

Since the <7,, (i = 1, • • • , 1/) are distinct elements in PGL2(F p ), it follows from the Goursat-Kolchin-Ribet 
criterion (see jTCl Prop. 1.8.2]) that the geometric monodromy group of the tensor product 

is equal to its arithmetic monodromy group and is the full product group 

SL 2 x ■ ■ ■ x SL 2 , 

which indicates an asymptotic independence of the values of the Kloosterman sums Kl2((7i • a;p) as a varies 
over F p such that at ■ a ^ 0, 00, (i = 1, . . . , u). 

Using Katz's effective form of Deligne's equidistribution theorem ([Kl §3.6]), we deduce that 

— E K1 C81 • a > i;pr • • • K1 (A* • «- 1;?)"" = II ^(( 2 ) + o Ml> ... , M „ (p- 1 / 2 ), 

aGFp, a-i.a^0,oo z— 1 

(l<i<i-) 

where the implied constant is independent of p and ust denotes the Sato- Tate probability measure on [0, 7r], 
which is given by 

Hsr(f(fi)) = - I f (0) sin 2 6d6 

(recall that [0,7r] is identified with the set of conjugacy classes of the compact group SU2(C) via the map 

g € SU 2 (C) i — y traced) = 2cos6», 

and that the Sato- Tate measure is the image of the probability Haar measure of SU2(C) under this map.) 
It follows by character theory of compact groups that 

mult(^) = ^st((2 cos OY) 

is precisely the multiplicity of the trivial representation in the /i-th tensor power Std® M of the standard 
2-dimensional representation of SUa(C). In particular, mult(/i) is a non-negative integer, and it is zero if 
and only if fi is odd (this is obvious when writing the integrals; representation-theoretically, mult(/i) = if fi 

is odd because f ^ acts by multiplication by (— 1) M on Std 8p , and mult(/z) > 1 for /i even, because 

Std® M is self-dual so mult(2/x) is the multiplicity of the trivial representation in End(Std® M ), and the identity 
endomorphism gives an invariant subspace; in fact, one can check that mult(2/i) = ( 2m ) /(/i + 1), a Catalan 
number.) 

As a consequence 

V 

A(/ii,--- ,fi„) = JJmult(/Xi), 

i=l 

is a non-negative integer, and it is non-zero if and only if all the fii are even, which corresponds precisely to 
the mirror configuration. Since mult(2) = 1, we also have A(2, • • • , 2) = I . □ 

Remark 3.3. Expanding the Kloosterman sums, we see that 6(n,f3,p) is a character sum in n + 1 variables. 
The proposition shows that this character sum has square-root cancellation, except if /3 is in mirror config- 
uration. As in [8], we see that the structure of 6(k,/3,p) (as a sum of products of Kloosterman sums) is 
crucial to our success, since it reduces the problem to detecting cancellation in the single variable a. 
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If ft = 2 and if /?i(a) = b\a and /82(a) = b 2 a are diagonal, we can use the fact that the Kloosterman sum 
is the discrete Fourier transform of the function x i-> e(x/p) (and 1— > 0) to get 

'x(l-5i&ah 1 



6(2, = £ Kl 2 (M;p)Kl 2 (& 2 a;p) = £ e (: 



by the discrete Plancherel formula. This is essentially a Ramanujan sum, and hence we see that the second 
moment (as in (jTTTOJ) ) does not require such delicate considerations. Moreover, because the error term is 
here <C p^ 1 (instead of p^ 1 ^ 2 ), the error term for the second moment is better than for the others, which 
explains the greater range of uniformity in the formula (|1 . 10[) of Lau and Zhao. More generally, for re = 2 and 
arbitrary /3 1; f3 2 G PGL 2 (F p ), the sum 6(2, (/3i,/3 2 ),p) can be identified with a special case of a correlation 
sum as defined in [51 §1.2], for the trace weight K(n) — e(n/p). The results of [SI Th. 9.1, §11.1] imply the 
statement of Proposition 13 . 21 for re = 2. 



We can now continue our study of the sum Si defined in (|3.3p . Since we have p \ rii, we have 

Kl 2 (arii \p) = Kl 2 (/3j ■ a;p), 
where /3, G PGL 2 (F P ) corresponds to the matrix 



th 




j J (modp). 



We denote (3 = (/?,, . . . , We also denote by /x(/3) the configuration of (3. Thus, by Proposition 13.21 and 
by (12. 9j) . we have the equalities 

fh=p E-E AfaW) T*(m)---T*(n K )w(^)...w(^) 

l<|ni|,...,|n«|<i>/2 

+o( P *( y d ^\ w (y)\T) 

l<|n|<p/2 

= pX ljM + 0(p^Y K ), (3.8) 

say, for any e > 0. 

Collecting (|3.3j) . (|3.5p and (|3.8p . the proof of Theorem ll.2l is already complete when re is odd, since trivially 
Si.m = in that case. 

3.4. Study of £i,m for even re. Remark that, by the definition of Si, we have the congruence n$ = 
rij mod p if and only if ft; = rtj. In the summation over n = (rii, . . . , n K ) defining E^Mj we can restrict the 
summation over the set of n such that the associated (3 is in mirror configuration by Proposition 13.21 

We now show that, in fact, the main contribution comes from the n in mirror configuration such that the 
configuration of the associated (3 is (2, 2, . . . , 2). It is easy to see that, for the remaining n, the associated 
configuration fi — (jj,±, . . . , fj, u ) is such that the length v is at most re/2 — 1 distinct elements, and satisfy 
fl! > 4. 

The equality Q3.7P and some combinatorial considerations lead to the following equality 
S 1)M = 3-5---(re-l)( Yl T ^ n f W {YT 

l<|n|<p/2 



O 



E E n E wK£) 



.l<f<f-l /Jl>-">AV>2 i=ll<\n\<p/2 
Hi H — h u —k 



= m K ( Y T*V?w(±)y +0{Y""- 1 +<) (3.9) 
l<|n|<p/2 

for any e > 0, the error term arising easily from (|2.9p (recall that m K is given by (I1.9P and is the re-th moment 
of a standard Gaussian). We therefore see that the proof of Theorem 11.21 is completed by combining (13.31) . 
(|3.5j) . ()3.8j) and (|3.9|) together with Proposition ^. 5) applied with a = b = 1. 
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3.5. Further remarks. We compare here the estimate of Theorem 11.21 with other bounds for the moments 
which can be derived straightforwardly from earlier results. For simplificity, we restrict our attention to the 
case of cusp forms. 

First, we note that it is fairly easy to deduce from Proposition 12.11 and from Proposition 12.31 that 

E f (X,c,a)<^ f ^d(c) 5 / 2 , (3.10) 

for any c > 1, X > c and any integer a. When c < X 2 / 3 , this statement is better than the bound 

S / (X,c,a)«X 1 /2+^i/2 

coming from Deligne's estimate for pf(n) (this is very similar to the result first proved by Smith [23l (4)] 
which has the same range of uniformity; see also the remarks in [2, p. 276] and the work of Duke and 
Iwaniec [SJ Th. 2]). Combining these two bounds in the definition (|1.4[) of A4, we obtain 

M f (X,c; K )< e X^-j min(l,-^-J . 
However, for k > 2, we can also write 

M f (X,c;n) < ( max \E f (X,c,a)\Y (- V \E f (X, c, a)\ 2 ), 

Va mod c / \C — 4 J 



. C 

a mod c 



and then using the result (I1.10[) of Lau and Zhao, we deduce a second inequality 

/c 2 \ k/2-1 

M f (X,c; K )^ e X^-) , (3.11) 

which holds uniformly for X^ < c < X. We then see that our result in Theorem 11.21 for c = p a prime, 
improves (|3 . for 

xi < p < xi and k > 3. (3.12) 

We conclude by noting that Theorem 11.21 can be extended without much effort to cusp forms / of arbi- 
trary level and nebentypus, which are not necessarily Hecke forms. On the other hand, it does not seem 
straightforward to extend the result to an arbitrary composite modulus c > 1 . 

3.6. Proof of Corollary 11.41 Corollary 11.41 is an easy consequence of the fact that convergence to a 
Gaussian can be detected by convergence of the moments to the Gaussian moments (see, e.g., [3l Th. 8.48, 
Prop. 8.49]). For p prime, let 

X = J3 2 /$(p), $(p) +00, $(p) < p f \ 

Denoting 

r , Y \ 1 (MX,P,a)Y 



P v ^ \/C+,w 

we see from Theorem 1 1 . 2 1 that for any e > 0, we have 

M*(X,p; k) = m K + o(§{p)- 1/2+t + p-? +t $(p) K/2 ' 
as p — ?• +oo. Since this holds for any fixed integer k > 1, this finishes the proof. 

Remark 3.4. (1) If X = p 2 ~ 5 for some fixed 8 > 0, we can not prove the Central Limit Theorem, but 
nevertheless, we still deduce that the K-moments converge to Gaussian moments when 

1 < K < - . 

id J 

(2) In this result, the Gaussian moments arise in Proposition 13.21 and in fact the combinatorics of the 
computation is the same as in a standard case of the Central Limit Theorem, namely the convergence in 
distribution to a standard Gaussian of a sequence 

2cos(Xi) H h 2cos(X„) 



Ah 



Yn 
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where the (Xj) are independent random variables (defined on some probability space) distributed on [0, 7r] 
according to the Sato- Tate measure. 

(3) It is natural to expect that an asymptotic formula 

M4X, P ;k,)~C4k), (3.13) 

should be true uniformly for any even k, and 

for some fixed 8 (0 < 8 < 1/2), which (with a corresponding upper-bound for the odd moments) would 
extend Corollary 11.41 to this range. This conjecture is true for k = 2 (by (|1.10[) 'l . and is in agreement with 
the square root cancellation philosophy (|1.2[) . 

Another partial indication in favor of this conjecture is that a lower bound of that size holds: considering 
* = / for simplicity, and taking k > 2 even, we have 

Mf(X,p;2)<(Mf(X,p; K ))" ■ (~ E l)*™, 

l<a<p 

and, by combining this with (jl.lOj) . we obtain the lower bound 

Mf(X,p;K) > 1 

uniformly for X^ < c < X 1 ^ 6 . 



4. Proof of Theorem 11.5 



The proof of this Theorem has many similarities with the proof of Theorem 11.21 particularly in the 
computation of the error terms. We will mainly concentrate on the study of the main term of the mixed 
moment Al*(X,p; k, A; 7). 

We suppose that (|1.5[) is satisfied and that p is sufficiently large in terms of 7. We start from the definition 
(|1.13p and apply the same computations leading to (|3.2[) . Q3.3[) and (|3.5II to write the equality 

W KK ,A; 7 ) = 4 i r(E 3 + O i (l- 1 )) (4.1) 
pY 2 



where 



s 3= E-E *)-^KF(y)'^(f) (4.2) 

l<|mi|,...,|m K |<p/2 



l<|n 1 |,...,|n A |<p/2 

x 2^ Kl 2 (mi a; p) •• -Kl 2 (m K a; p)K\ 2 {n\ (7- a) ;p)---Kl 2 (nA (7- °) IP)- 

l<a<p 
a,7-a^0,oo 

Since p divides none of the m, or rij, we see that the inner sum over a is equal to 6(/c + A,/3,p), as defined 
in Proposition 13.21 where 

P = (Km h mK , h ni o 7, . . . , h nx o 7) , (4.3) 

and ft m denotes the homothety 

™ ;)gpgl 2 (f p ). 

To apply Proposition 13.21 we have to understand which (3 are in mirror configuration, in the sense of 
Definition 13.11 This depends on whether 7 is diagonal or not. 

19 



4.1. When 7 is not diagonal. If 7 is not a diagonal matrix, then 

h mi ^ h nj o 7 

for any i = 1, . . . , k and for any j = 1, . . . , A. Hence, in that case, the configuration of /3 defined by (|4.3[) has 
(before ordering the elements by decreasing order) the shape 

= (Mir" .AtfjAt'u" - 

where 

fl = ([!!,••■ ,Hv), H' = (/i'l,--- 

are the configurations of 

(/i mi ,...,/i m J, (/i ni 07,...,^ 07), 
respectively. It follows from Proposition 13.21 that 



6(k + \,/3,p) = Kl 2 (mia;p) • • • Kl 2 (m K a;p) Kl 2 (711(7 • a);p) ■ ■ •Kl 2 (n A (7 • a);p) 



l<a<p 
a, 7-07^0,00 



= A(/x)A(//)p + O k ,a(^). (4.4) 
Hence by (|4.1|) , (14.21) and (14. 4[) and by computations similar to those we did in H3.31 we deduce the equality 

M*(X,p;K,\^) = Y-^(p 3t M(K)V a , M (\) + 0(p-i +e Y K+x f) +0(p- 1 ), (4.5) 



with 



S 3 ,m(«) = r,(mi) • • • r *(m K )w(^) ■ • • w(^) A{n), 

l<|mi|,...,|m re |<p/2 

S 3 ,m(A)= 7-*(»i)---T*(nx)^(2l)-..w(^)A(M / ). 



l<|m|,...,|n A |<p/2 

If k or A is odd, the product A(fi)A(fi') is zero, hence (|1.18l) follows in that case. If k and A are both 
even, then as in (|3.9[) . we prove that the largest contribution comes from the case where fi = (2, . . . , 2) and 
{a! = (2, . . . , 2). Hence, by a computation similar to (|3.9j) and (|1.6j) . we get the equality 

S 8 ,m(«) = {a(K) + 0(F-2+^)}yf, 
and a similar one for £3,^ (A). Hence, by (|4.5|) . we complete the proof of (|1.18[) . 
4.2. When 7 is diagonal. We then write 7 in the canonical form (11.14[) and we suppose that 

P > max(|7i|, I72Q. 

Then, by making the change of variable a — 72 a', we find that the sum over a of normalized Kloosterman 
sums appearing in the last line of (|4.2p is equal to &(n + \,f3,p) as defined in Proposition 13.21 with 

(3 (^7 2 mi j • ■ • j ^72m K ) ^Tmi > ■ • - > h"fin\ ) ■ (4-6) 

If the configuration of (3 is not a mirror configuration, we have 

e(K + \,p,p) = o(p*). 

In particular, if k ^ A mod 2, we deduce by (|4.2p . (|2.9|) and by similar treatment of the error terms as 
above, that 

£ 3 <p^y*+\ (4.7) 
Combining this with (|4.1I) we complete the proof of (|1.19[) when k and A have opposite parity. 
Now assume that k and A have same parity. The combinatorics involved is then more delicate than in 

H4.ll because me must take into account the cases of crossed mirror configurations, namely situations when 

some of the 72^ are equal to some of the 71 nj. 

To be precise, we can decompose £3 (see (|4.2|l ) into 

£ 3 = B nm + B™ + ( 4 - 8 ) 

0<f<min(re ) A) 
v=k=\ mod 2 
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where 

• B nm corresponds to the contribution of the (72^1, . . . , 72W K , 7ini, . . . , 7i^a) which are not in mirror 
configuration, 

• Bq 1 corresponds to the contribution of the (72WH, 72W«, 7i^i, 71 n\) which are in mirror 
configuration, but that configuration is not (2, . . . , 2), 

• B m (u) corresponds to the contribution of the (72TI1, . . . , 72Ti K , 71^1, • ■ • , Jinx) which have a mirror 
configuration equal to (2, . . . , 2), and where exactly v of the 72777^ (1 < i < k) are equal to v of the 
n u rij (1 < j < A). 

The same computation as for (|4.7|) gives the relation 

B nm < p? +£ Y K+x , 

which, when combined with (I4.1[) . fits with the error term in (jl.171) . 

We can also estimate B™ by following the same technique which led to the error term in (|3.9p , and obtain 

B™^pY^~ 1+e , 

which, by (|4.1j) is absorbed by the error term in (|1.17[) . 

The case of B m {v) is more delicate to treat. For the terms in that sum, exactly v of the 72"!^ (1 < i < k) 
are equal to v of the 7i?ij (1 < j < A), and the remaining 72771^ (resp. Jirij) are in configuration (2, . . . , 2). 
The condition 72777,; = JiUj can be parametrized by m t = 71 i and rij = 72^ where t is a non-zero integer. 
Appealing to Proposition 13. 2[ and applying some combinatorial considerations, we deduce the formula 

l<|7i*h l72t|<p/2 



x (1- 3- ■■(«-!/-!))( ^ r;(m)M/ 2 



l<|m|<p/2 



x(l.3...(A-,-l))( ^ ^W^)) r +0( p H^). (4.9) 

l<|n|<p/2 

In this expression, the first term corresponds to the choice and to the contribution of the v integers m; and 
v integers rij which satisfy the condition 72 nij = 71 rij. The second factor corresponds to the contribution 
of the k — v remaining m, which are in configuration (2, . . . , 2) between themselves, and the third factor to 
the A — v remaining rij in configuration (2, . . . , 2) between themselves. Finally, the error term comes from 
the error term in (I3.6[) . 

Using the arithmetic sums B±(m 1 n,Y) defined in Proposition 12.51 we can thus summarize (|4.9p in the 
form 

B m (v) — p — — — — ~vLt777T -B*(l,l ) y) fl * i -"B*(7i 1 ntf" + 0(pi+*Y s &). (4.10) 

v y 2-jh-" ((« - u)/2)\ ((A - v)/2)\ 

We now obtain (ITT9|) by combining (|3Tj) . (|4~Tj) . (|4~8l) . (|470|) and Proposition [231 



5. Proof of Corollary 11.61 

We now deduce Corollary 11.61 from Theorem 11.51 The probabilistic tool is the following standard lemma: 

Lemma 5.1. Let (X„,Y„) be a sequence of real-valued random variables. Let Q be a positive definite 
symmetric 2x2 matrix. Suppose that, for any integers A, k > 0, we have 

E(X£Y£) — ► m K , A (Q) 

as n — > +00, where m K> x(Q) = E(A K B A ) for some centered gaussian vector (A, B) with covariance matrix 
Q. Then (X„,Y„) converges in law to (A, B). 

This follows from the case of individual sequences using the characterization of the Gaussian vector (A, B) 
by its linear combinations aA + /3B being Gaussian. 
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We apply this lemma to the sequence (Z p , Z p o 7) for p prime, as in the statement of Corollarv ll.6l Note 
that if ★ = d, the main term Cd{n, A, 7) still depends on p (because of the polynomials of (\ogp 2 /X) which 
it involves). However, under the assumptions of Corollarv ll.6l on X and p, we see that in all cases, for fixed 
k > and A > 0, the limit 

T - lim C *^ X ^) 

exists, and that 

lim E(Z?(Z p o 7 ) A ) =L K>X . (5.1) 

p— f+00 ^ 

If 7 is not diagonal, we get by (|1.18[) and (|1.7I) that L K >, = wi K mA which coincides obviously with the 
mixed moment E(A"B A ) where (A,B) are independent centered Gaussian variables with variance 1, so we 
obtain Corollarv ll.6l in that case. 

If 7 is diagonal, we must check that L Kj a corresponds to the mixed moments of a gaussian vector (A, B) 
with covariance matrix given by (jl.21l) . For this purpose, we use the formula (|1.19[) and note that 



>+A)/2 

as p — > +00, with notation as in (jl,19|) and Corollarv ll.6l Thus, abbreviating G — G+ t j )W , we compute the 
2-variable exponential generating series of L Kt \ by writing 

^,A>0 /c,A>0 0<i^<min(K,A) v y x y 

v=k=\ mod 2 

^ G»(UVy ^ m 2k U 2k ^ m 2l V 21 f U 2 V 2 

= ^ v\ ^ ~(2fc)F ^ ~(2l)r = eXP (^~ ~2~ 

i/>0 fe>0 v ; 2>0 v 7 

Since this is well-known to be the exponential generating series of the moments of the Gaussian vector 
with covariance matrix (jl.2ip . we obtain the desired convergence in law. 
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